Linear prediction incorporating simultaneous masking

نویسندگان

Jason Lukasiak

Ian S. Burnett

Joe F. Chicharo

M. M. Thomson

چکیده

Whilst linear prediction is the cornerstone of most modern speech coders, few of these coders incorporate the perceptual characteristics of hearing into the calculation of the linear predictor coefficients (LPCs). This paper proposes a method of incorporating simultaneous masking into the calculation of the LPCs. This modification requires only a modest increase in computational complexity and results in the linear predictor removing more perceptually important information from the input speech signal. This results in a filter that better models the formants of the input speech spectrum. The net effect is that an improvement in quality is achieved for a given bit rate or alternately a bit rate reduction can be achieved while maintaining perceived quality. These results have been confirmed through subjective listening tests. Disciplines Physical Sciences and Mathematics Publication Details This paper originally appeared as: Lukasiak, J, Burnett, IS, Chicharo, JF & Thomson, MM, Linear prediction incorporating simultaneous masking, ICASSP '00. Proceedings. 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, 5-9 June 2000, vol 3, 1471-1474. Copyright IEEE 2000. This conference paper is available at Research Online: http://ro.uow.edu.au/infopapers/218 LINEAR PREDICTION INCORPORATING SIMULTANEOUS MASKING J. Lukasiak, IS. Burnett, J . F. Chicharo, M.M. Thomson * Whisper Laboratories, TITR University of Wollongong Wollongong, NSW, Australia, 2522 *Motorola Australian Research Centre, Botany, NSW, Australia, 201 9

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Low rate speech coding incorporating simultaneously masked spectrally weighted linear prediction

Linear prediction (LP) is the cornerstone of most modern speech compression algorithms. Previously it has been shown that incorporating a weighting function based on the simultaneous masking property of the ear into the calculation of the LP coefficients (SMWLPC) allows the filter to better model the unmasked sections of the input spectrum. This paper conducts a detailed analysis of the impleme...

متن کامل

Perceptual wavelet packet audio coder

Traditional wavelet packet audio compression algorithms do not utilize the temporal masking properties of the human auditory system, relying instead on simultaneous masking models. This paper presents the design and implementation of a perceptual wavelet audio coder by incorporating temporal and simultaneous masking models. The efficiency of the encoder was assessed based upon the number of bit...

متن کامل

Masking by inaudible sounds and the linearity of temporal summation.

Many natural sounds, including speech and animal vocalizations, involve rapid sequences that vary in spectrum and amplitude. Each sound within a sequence has the potential to affect the audibility of subsequent sounds in a process known as forward masking. Little is known about the neural mechanisms underlying forward masking, particularly in more realistic situations in which multiple sounds f...

متن کامل

Single channel speech enhancement by frequency domain constrained optimization and temporal masking

A speech enhancement algorithm is proposed that exploits the masking properties of the human auditory system. The enhancement is formulated as a frequency domain constrained optimization problem. The noise components of the noisy speech are suppressed by a gain function subject to the constraint that both the signal distortion and residual noise should fall below the masking thresholds. Tempora...

متن کامل

Linear and Nonlinear Processes in Temporal Masking

A number of masking phenomena can be modeled in terms of a linear auditory filter bank followed by a temporal integrator and a simple decision device based on the signal-to-masker ratio. Other aspects require the inclusion of a nonlinearity following linear filtering. The present article concentrates on aspects of non-simultaneous, or “temporal”, masking that cannot be explained by either model...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

Linear prediction incorporating simultaneous masking

نویسندگان

چکیده

منابع مشابه

Low rate speech coding incorporating simultaneously masked spectrally weighted linear prediction

Perceptual wavelet packet audio coder

Masking by inaudible sounds and the linearity of temporal summation.

Single channel speech enhancement by frequency domain constrained optimization and temporal masking

Linear and Nonlinear Processes in Temporal Masking

عنوان ژورنال:

اشتراک گذاری